feat: added example FastAPI-based inference server for Qwen-ASR by kyr0 · Pull Request #31 · QwenLM/Qwen3-ASR

kyr0 · 2026-01-30T19:46:12Z

Addressing #15 and a few other questions, I've implemented, tested and benchmarked Qwen-ASR on a 1x NVIDIA H200 NVL throughly, coming up with this inference server implementation that is both simple and fully-featured. This might serve as a boilerplate for more decent implementations -- I believe this is quite a good sweet-spot right now. It does scale well under load, it is configurable, yet still easy to understand. I've also implemented readiness and simple monitoring/SRE features. Forced Aligner is supported as well -- every feature documented in the examples folder should be addressable with this easily. Also, the server.py is volume mounted; so you don't need to rebuild the container on app code change -- another DX improvement. The container also loads the model in HF_HOME of the host. Last but not least, I've provided local and remote reference audios that were generated with ... Qwen-TTS :)

I hope this will reduce the load of issues opened because of confusion.

Requirements: NVIDIA Container Toolkit should be installed on the host (!).

…ng all features QwenLM#15

RomiVu · 2026-02-03T08:30:24Z

AI-generated garbage

kyr0 · 2026-02-08T18:43:20Z

@RomiVu Have you even tried it? You have 2 contributions this year and you react like this on a working solution that has already gathered a few stars? I really wonder how bitter you must feel.. https://github.com/kyr0/fast-qwen-asr-inference-vllm

feat: added example FastAPI-based inference server for Qwen-ASR showi…

833bcc2

…ng all features QwenLM#15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: added example FastAPI-based inference server for Qwen-ASR#31

feat: added example FastAPI-based inference server for Qwen-ASR#31
kyr0 wants to merge 1 commit intoQwenLM:mainfrom
kyr0:feat/inference-server-example

kyr0 commented Jan 30, 2026 •

edited

Loading

Uh oh!

RomiVu commented Feb 3, 2026

Uh oh!

kyr0 commented Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kyr0 commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RomiVu commented Feb 3, 2026

Uh oh!

kyr0 commented Feb 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kyr0 commented Jan 30, 2026 •

edited

Loading